Real-time Learning when Concepts Shift

ثبت نشده
چکیده

We are interested in real-time learning problems where the underlying stochastic process, which generates the target concept, changes over time. We want our learner to detect when a change has occurred, thus realizing that the learned concept no longer fits the observed data. Our initial approach to this problem has been to analyze offline approaches to addressing concept shifts and to apply them to real-time problems. This work involves the application of the Minimum Description Length principle to detecting real-time concept shifts. Introduction If enough consistent data can be obtained, standard machine learning algorithms can be applied to most learning problems in an offline, batch learning approach [5, 6, 7]. Our interest is in an online, sequential learning process, where the probabilistic functions that are generating the attributes and classifications of the world are changing over time. In the classic supervised learning problem [5, 9], it is generally stated that some stochastic function F(x) is generating an attribute vector x, based on a fixed probability distribution. The attribute vector, x, represents the salient features of the world. In a batch learning mode, the learner is given preclassified training examples based on a conditional distribution function F(y|x), where the values that y can take on are the classification values that would be associated with an instantiation of the attribute vector x. This function is unknown to the learner. Consequently, it is the job of the learner to attempt to approximate this function, F(y|x), as best it can by observing the training values and applying a learning algorithm. The online, sequential learning process presents several added difficulties. For example, since the learner is receiving it’s training data sequentially; it will need to repeatedly apply the learning algorithm until it is satisfied that it has converged to a good model of the world. This means that it has to store all of the previously encountered training vectors in some usable form. Due to the enormity of the datasets, some summarization technique must be used that does not sacrifice valuable information [3]. This summarization technique must also be chosen so that it does not inappropriately bias the next learning iteration toward a previously learned model. Our ultimate goal is to address uncertainty in real-time distributed computing problems with agent-based solutions. At the core of these agent-based solutions would be a realtime learning component that can detect and address shifts in the underlying target concept. Our initial approach to this problem has been to analyze offline approaches that address concept shifts and to apply them to real-time problems. Our first effort involves the application of the Minimum Description Length principle to detecting concept shifts. Sliding Window A current method used to address concept drift is the sliding window [4, 10] approach (figure 1). The idea is for the learner to only consider the most recent n observed examples when learning the concept. There are a number of parameters the learner must consider, such as the size of the window and whether the window size is adaptive or stable. The size of the window is of importance in that a small window would compromise the confidence in the learned concept. Learning a concept on a small data sample can easily lead to overfitting the learned concept to small aberrations in the data or simply overlooking entire elements of the target concept. Conversely, large window sizes lead to resource issues, in real time systems and may also leave the learned concept overly vulnerable to drift in the target concept. If the window size is large enough so that the range of enclosed attribute vectors drifts significantly, then the learned concept will perform poorly in a predictive capacity. Lastly, it seems that the sliding window approach is much better suited to concept drift, rather than concept shift. Concept drift occurs when the target concept changes gradually, while concept shift occurs when the target concept changes instantly. Instantaneous change in the target concept would leave the learned concept open to error until the entire window was drawn from the new target concept. Problem Domain The goal of our research is to create highly adaptable, real-time distributed computer systems. The benefits of this work could be realized through improved network data transmission speeds, greater autonomous control and coordination in satellite constellations and robot applications, and greater reliability in wireless communication, just to name a few. Even under controlled conditions, distributed computing issues, such as maintaining global state and determining causality among a series of events, are fraught with difficulty. There are known algorithms for addressing these problems, however they are designed to work in environments where communication reliability is guaranteed and processors never fail. These algorithms quickly fail when a distributed computing application is introduced into a volatile environment, in which communication links may be unstable, the number of nodes may change, or adversaries may introduce errors into the communication channel. Our research addresses the need for autonomous adaptability in real-time environments by introducing specialized intelligent agent technology into the distributed computing arena. Using this technology, we are providing a method for coping with uncertainty in order to address real-time adaptability issues. Time Figure 1. Addressing real-time concept drift/shift with a sliding window approach. The learner only considers the most recent n attribute vectors. Current attribute

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real-Time Learning when Concepts Shift

We are interested in real-time learning problems where the underlying stochastic process, which generates the target concept, changes over time. We want our learner to detect when a change has occurred, thus realizing that the learned concept no longer fits the observed data. Our initial approach to this problem has been to analyze offline methods for addressing concept shifts and to apply them...

متن کامل

Using Machine Learning for Exploratory Data Analysis

This tutorial will introduce attendees to fundamental concepts in the clustering and dimensionality reduction fields of unsupervised machine learning. Attendees will learn about the assumptions algorithms make and how those assumptions can cause the algorithms to be more or less suited to particular datasets. Hands-on interaction with machine learning algorithms on real and synthetic data are a...

متن کامل

Using Context-Aware Ubiquitous Learning to Support Students in Learning Angle Concepts

Mobile learning (mlearning) offers many new opportunities in the evolution of technology enhanced learning (Looi et al., 2010). These technologies are seeping into educational settings as the affordances are becoming recognized for the way in which they extend the pedagogical boundaries. Mlearning can provide a shift from the abstract concepts to the contextualized. In other words, students can...

متن کامل

Collaborative learning aspects in VSE

In the last years the attention for the learning methodologies has produced a shift from simply transmitting the knowledge to actually building knowledge (Vrasidas, Charalambos, 2000). The active involvement, the reciprocal collaboration among the learners, the concrete experimentation in the learning process are become the main features to consider in a learning process (Merrill, 2001). In fac...

متن کامل

Generalization of Canonical Correlation Analysis from Multivariate to Functional Cases and its related problems

In multivariate cases, the aim of canonical correlation analysis (CCA) for two sets of variables x and y is to obtain linear combinations of them so that they have the largest possible correlation. However, when x and y are continouse functions of another variable (generally time) in nature, these two functions belong to function spaces which are of infinite dimension, and CCA for them should b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000